Categorical classifiers in multiclass classification with imbalanced datasets
نویسندگان
چکیده
This paper discusses, in a multiclass classification setting, the issue of choice so-called categorical classifier, which is procedure or criterion that transforms probabilities produced by probabilistic classifier into single category class. The standard Bayes Classifier (BC), but it has some limits with rare classes. studies performance BC versus two alternatives, are Max Difference (MDC) and Ratio (MRC), through an extensive simulation case studies. results show both MDC MRC preferable to setting imbalanced data.
منابع مشابه
Classification in Imbalanced Datasets
In this thesis we study the classification task in the presence of class imbalanced data. This task arises in many applications when we are interested in the under-represented (minority) classes. Examples of such applications are related to fraud detection, medical diagnosis and monitoring, text categorization, risk management, information retrieval and filtering. Although there exist many stan...
متن کاملImbalanced Datasets: from Sampling to Classifiers
Classification is one of the most fundamental tasks in the machine learning and data-mining communities. One of the most common challenges faced when trying to perform classification is the class imbalance problem. A dataset is considered imbalanced if the class of interest (positive or minority class) is relatively rare as compared to the other classes (negative or majority classes). As a resu...
متن کاملCo-Multistage of Multiple Classifiers for Imbalanced Multiclass Learning
In this work, we propose two stochastic architectural models (CMC and CMC-M ) with two layers of classifiers applicable to datasets with one and multiple skewed classes. This distinction becomes important when the datasets have a large number of classes. Therefore, we present a novel solution to imbalanced multiclass learning with several skewed majority classes, which improves minority classes...
متن کاملHybrid classification approach for imbalanced datasets
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi CHAPTER
متن کاملUsing Two-Class Classifiers for Multiclass Classification
The generalization from two-class classification to multiclass classification is not straightforward for discriminants which are not based on density estimation. Simple combining methods use voting, but this has the drawback of inconsequent labelings and ties. More advanced methods map the discriminant outputs to approximate posterior probability estimates and combine these, while other methods...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Statistical Analysis and Data Mining
سال: 2023
ISSN: ['1932-1864', '1932-1872']
DOI: https://doi.org/10.1002/sam.11624